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Abstract — Glauber dynamics is a powerful tool to generate 
randomized, approximate solutions to combinatorially difficult 
problems. It has been used to analyze and design distributed 
CSMA (Carrier Sense Multiple Access) scheduling algorithms 
for multi-hop wireless networks. In this paper we derive bounds 
on the mixing time of a generalization of Glauber dynamics where 
multiple links are allowed to update their states in parallel and 
the fugacity of each link can be different. The results can be 
used to prove that the average queue length (and hence, the 
delay) under the parallel Glauber dynamics based CSMA grows 
polynomially in the number of links for wireless networks with 
bounded-degree interference graphs when the arrival rate lies in 
a fraction of the capacity region. We also show that in specific 
network topologies, the low-delay capacity region can be further 
improved. 

I. Introduction 

In wireless networks, the links (transmitter-receiver pairs) 
may not be able to transmit simultaneously due to interference. 
A scheduling algorithm, or MAC (Medium Access Control) 
protocol, determines which links can access the medium in 
each time instant so that no active links interfere with each 
other. Since many wireless network applications today have 
stringent bandwidth and delay requirements but the resources 
(e.g., spectrum, power) are often quite limited in a wireless 
setting, designing low-complexity scheduling algorithms to 
achieve high throughput and low delay is of great importance. 

It is well known that the queue-length based Maximum 
Weighted Scheduling (MWS) algorithm is throughput-optimal 
[21 1, meaning that it can stabilize the network queues for all 
arrival rates in the capacity region of the network. However, 
MWS requires the network to select a max-weight independent 
set in the interference graph in every time slot, which is NP- 
hard for general interference graphs [5|. There exist several 
low-complexity alternatives such as Maximal Scheduling and 
Greedy Maximal Scheduling, but in general these algorithms 
can only guarantee to achieve a fraction of the capacity region 
(see lfl2l and references therein). 

Due to their simplicity, random access type scheduling 
algorithms such as Aloha and Carrier Sense Multiple Access 
(CSMA) are widely used in practice. Performance analysis of 
random access algorithms in single-hop wireless networks can 
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be found in HI . In Q the authors introduced a continuous-time 
Markov chain model to analyze the performance of a (fixed- 
parameter) CSMA algorithm in multi-hop wireless networks, 
and it was shown that the stationary distribution over the 
schedules has a product form. The model was used in ll23l to 
study throughput and fairness issues of the CSMA algorithm, 
and its insensitivity properties were studied in lfl3ll . A discrete- 
time version of the algorithm was studied in fl6l . flTl . 

In iflOl the authors proposed an adaptive CSMA algorithm 
where the links adaptively adjust their parameters based on 
locally measured arrival and service rates. The algorithm was 
shown to be throughput-optimal under a time-scale separation 
assumption (the CSMA Markov chain converges to its steady- 
state distribution instantaneously compared to the time-scale 
of adaptation of the CSMA parameters) which can be justified 
using a stochastic-approximation argument fl4l . ifTTl . In |[l~8ll 
the authors established throughput-optimality of their adaptive 
CSMA algorithm without the time-scale separation assumption 
by choosing the link parameters to be slowly varying functions 
of the queue lengths. The discrete-time equivalent of this 
analysis for the model in fl6l . IfTTIl appears in J6|. 

Central to these CSMA algorithms is the so-called Glauber 
dynamics, which is a Markov Chain Monte Carlo method 
that can be used to sample the independent sets of a graph 
according to a product-form distribution |4j, 11221 . Under 
traditional Glauber dynamics, in each time slot one link will 
be selected uniformly at random, and only that link can change 
its state while other links will keep their states unchanged. For 
the chosen link, if all of its neighboring links were in state 
(inactive) in the previous slot, then the link will choose 
to be in state 1 (active) with probability an d in state 
with probability j^j', otherwise (i.e., if at least one of its 
neighboring links was in state 1 in the previous slot), the link 
will choose to be in state definitely. In statistical physics, the 
parameter A is called the fugacity since it indicates how likely a 
selected site will change its state Q Glauber dynamics has many 
applications in statistical physics, graph coloring, approximate 
counting, and combinatorial optimization (e.g., [4|, 1 15], |22|). 
In these applications, the performance of the Glauber dynamics 
is often determined by how fast the Markov chain converges to 

1 And the same fugacity A is shared by all links under the traditional Glauber 
dynamics. In the current paper, however, we need to consider heterogeneous 
fugacities. That is, different links have different fugacities. 
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the stationary distribution. The Glauber dynamics is said to be 
fast mixing if the mixing time (will be defined formally later) 
grows polynomially in the size of the graph. The algorithm 
in fl6l . ifTTl is a generalization of Glauber dynamics where 
multiple links are allowed to update their states in parallel. 
The CSMA algorithm in (2) can be viewed as continuous- 
time Glauber dynamics with fixed parameters, and the CSMA 
algorithms in [10], ifTSl can be viewed as continuous-time 
Glauber dynamics with adaptive parameters. In this paper, 
we focus on the discrete-time CSMA algorithm suggested in 
[16 1, fTD . An important feature of this algorithm is that the 
overhead for signaling is constant (independent of the size of 
the network) even taking into account the collisions during the 
signaling phase. The results in this paper can be extended to 
other versions of CSMA algorithms as well. 

The recently proposed Glauber dynamics based CSMA 
algorithms have made an important progress to the design of 
low-complexity distributed scheduling algorithms to achieve 
maximum throughput in wireless networks. On the other hand, 
the delay performance of these CSMA algorithms has not 
been well understood. As shown in a recent work EOl . for 
general networks, it may not be possible to design low- 
complexity scheduling algorithms which can achieve both low 
delay (i.e., grows polynomially in the number of links) and 
even a diminishingly small fraction (i.e., approaches zero when 
the number of links increases) of the capacity region unless 
NP C BPP or P = NP. In this paper we prove a positive 
result. We show that our parallel Glauber dynamics based 
CSMA scheduling algorithm can achieve both low delay and 
a fraction (independent of the size of the network) of the 
capacity region when the interference graphs satisfy certain 
properties. 

The main contributions of this paper include: 

> We analyze the mixing time of Glauber dynamics with 
parallel updates and heterogenous fugacities. We derive 
various conditions on the system parameters such as 
fugacities, vertex degrees and update probabilities, under 
which the mixing time grows logarithmically in the 
number of vertices. 

> Based on the above mixing time results, we show that, 
for wireless networks with bounded-degree interference 
graphs, the parallel Glauber dynamics based CSMA algo- 
rithm can achieve a small queue length (O(logn) where 
n is the number of links in the network) at each link if 
the arrival rate lies within a fraction (independent of n) 
of the capacity region and the fugacities of the links are 
appropriately chosen and fixed. Moreover, we consider 
an adaptive version of the CSMA algorithm where the 
fugacities of the links are adjusted based on local queue 
length information. We show that the total queue length 
in the network grows polynomially (0(n 3 log n)) under 
the adaptive CSMA algorithm. 

• Unlike prior analysis of Glauber dynamics based CSMA 
algorithms which uses the conductance method to obtain 
exponential bounds on the mixing time and the queue 
lengths, here we use the coupling method which allows 



us to obtain polynomial bounds on the queue lengths for 
a fraction of the capacity region. 
• For a special but important network topology (Wireless 
LANs), we show that Glauber dynamics based CSMA 
can support the full capacity region with linear mixing 
time. 

In a related work fl9l . the authors proposed a CSMA algo- 
rithm which can achieve order-optimal delay performance (i.e., 
the per-node delay is bounded by a constant) for networks with 
polynomial growth structure (the number of r-hop neighbors 
of any node is bounded by a polynomial of r). The key idea is 
to (periodically) partition the network into sub-networks with 
bounded number of nodes in each sub-network such that the 
mixing time of the Markov chain of the schedule in each sub- 
network is bounded by a constant. However, the partition is 
realized by a distributed coloring algorithm which requires the 
nodes to exchange messages with their multi-hop neighbors, 
and this overhead can be significant and may increase with 
the size of the network, especially when the discrete nature of 
signaling is considered. So a direct comparison of our results 
and the results in |fl9l appears to be difficult. 

The paper is organized as follows. In Section|II]we introduce 
a CSMA scheduling algorithm based on a generalization of 
Glauber dynamics with parallel updates and heterogenous 
fugacities. In Section [HI] we derive bounds on the mixing 
time of parallel Glauber dynamics, with the proof presented 
in Section ITVl In Section [V] we analyze the delay performance 
of the CSMA scheduling algorithms using the mixing time 
results. Section [VI] is dedicated to the analysis of complete 
interference graphs. The paper is concluded in Section IVTT1 

II. CSMA Scheduling Based on Parallel Glauber 

DYNAMICS 

For a wireless link i, we use A/i to denote the set of 
conflicting links (called conflict set or neighbor set) of link 
i: if any link in A/i is active (transmitting), then link i cannot 
be active. 

The interference relationship among the wireless links can 
be represented by the so-called interference graph (or conflict 
graph) G — (V, E), where the vertices in V represent wireless 
links in the network, and there is an edge between two vertices 
in G if the corresponding wireless links interfere with each 
other. (For example, FigUta) shows a wireless network of 3 
links, where link 2 interferes with links 1 and 3, and links 
1 and 3 don't interfere with each other. So, the associated 
interference graph is shown in Fig[TJb), and the conflict set 
of link 2 is A/a = {1,3}, etc.) An independent set of G is a 
subset of the vertices in V where no two vertices are neighbors 
of each other. Let I be the set of all independent sets of G. 

A feasible schedule of the network is a set of wireless links 
that can be active at the same time according to the conflict 
set constraint, i.e., no two links in a feasible schedule conflict 
with each other. This corresponds to an independent set of 
vertices in the interference graph. (In Fig02b), for example, 
the sets {1}, {1, 3}, {2} are all feasible schedules.) We will 
use links and vertices interchangeably throughout this paper. 
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(a) A wireless network of 3 links. (b) The interference graph. 

Fig. 1. An example network and the corresponding interference graph. 

Suppose |V| = n. We can represent a feasible schedule by 
a vector a of the form (<Ji)iey, with Cj e X = {0, 1} for all 
i G V. For a link i and a schedule er, we say link i is included 
in the schedule (written as i £ a) if Oi = 1. Note that a is a 
feasible schedule if the set {i G V : o~i = 1} is an independent 
set of G, i.e., if er, + a 3 < 1, for all (i, j) 6 £. Let fi C AT" 
be the set of all feasible schedules on G. 

We consider a time-slotted system. A scheduling algorithm 
is a procedure to decide which schedule to be used in every 
time slot for data transmission. In [16| we proposed a schedul- 
ing algorithm based on a generalization of Glauber dynamics 
(called parallel Glauber dynamics) where multiple links are 
allowed to update their states in a single time slot. The key 
idea is that in every time slot, we select an independent set of 
links m E I to update their states according to a randomized 
procedure, i.e., we select m 6 X with probability q m , where 
Emgi*?™ = 1- We call m the decision schedule^ 

The parallel Glauber dynamics is formally described as 
follows. 



Parallel Glauber Dynamics (in Time Slot t) 



1. Randomly choose a decision schedule m(i) £ T with 
probability q m(t) . 

2. For every link i G m(<): 

if E i6 M^(*- 1 ) = ° 

(a) a-i{t) = 1 with probability pi = j^-- 

(b) Cj (t) = with probability pi = . 
Else 

(C) £Tj(t) = 0. 

For every link j ^ m(t) : 

(d)<7 J (t) = <7 J (t-l). 



Under the Parallel Glauber Dynamics based CSMA (called 
PGD-CSMA for short), a(t) is used as the transmission 
schedule in time slot t: link i will transmit a data packet 
if <Ji(t) — 1, and will keep silent if ai(t) = 0. Note that 
link i knows whether YljeM ~~ -0 = by conducting 

2 A distributed mechanism to generate the decision schedule, as suggested 
in |' 16'], is the following. At the beginning of a slot, each link independently 
transmits a short INTENT message with probability a £ (0, 1). A link is 
included in the decision schedule if (and only if) it sends an INTENT message 
while none of its neighbors sends such a message. 



carrier sensing in time slot t—1: the channel (medium) will be 
sensed idle if none of its neighboring links were transmitting 
(i.e., 2~2jejv- a j{t — 1) = 0)- Pi i s called the link activation 
probability, which is determined by the fugacity A, of link i. 

We can show that if the transmission schedule used in the 
previous slot and the decision schedule used in the current slot 
both are feasible, then the transmission schedule generated in 
the current slot is also feasible 1161 . Moreover, given the fugac- 
ities Aj's, because a(t) only depends on the previous schedule 
<r(t — 1) and some randomly selected decision schedule m(i), 
a(t) evolves as a discrete-time Markov chain (DTMC). 

Theorem 1: ( H16V ) A necessary and sufficient condition for 
the parallel Glauber dynamics to be irreducible and aperiodic 
is U me i :9m> o m = V, or equivalently, the probability of 
selecting link i in the decision schedule :— ^2 m3i q m > 
for all i 6 V. In this case the Markov chain is reversible and 
has the following product-form stationary distribution: 

( ) = v fr F' (1) 

Based on the product-form distribution, one can establish 
throughput-optimality of PGD-CSMA by either choosing the 
link activation probabilities (fugacities) as appropriate increas- 
ing functions of the (time-varying) queue lengths, or adjusting 
the fugacities based on the measured arrival and service rates, 
as in [i(J|, lfl6]| , ifTHIl . The focus of this paper is to analyze 
the mixing time of parallel Glauber dynamics and the delay 
performance of PGD-CSMA. We will show that the parallel 
Glauber dynamics is fast mixing when the fugacities satisfy 
certain conditions, and this implies that PGD-CSMA induces 
small queue lengths when the arrival rates lie in a fraction of 
the capacity region. 

III. Mixing Time of Parallel Glauber Dynamics 
A. Definitions 

Consider a finite-state, irreducible, aperiodic Markov chain 
(P, SI, 7r) where P denotes the transition matrix, £1 denotes the 
state space, and 7r denotes the unique stationary distribution. 
First we describe a notion of distance between distributions. 

Definition 1: The variation distance between two distribu- 
tions fi, pi on f2 is defined as 

\\p - p'\\var = 2 ^2 ~ /A 1 ) I- ( 2 ) 

Note that < \\p — p'\\ V ar < 1 and \\p, — fi'\\ va r = if and 
only if /i = fj,'. 

Definition 2: The mixing time T m i x of the Markov chain is 
defined as the time required for the Markov chain to get close 
to the stationary distribution. More precisely, 

T m ix = maxinf it : \\p x ,t - K\\ var < -\ (3) 
xen L eJ 

where /Lt X) j is the distribution of the Markov chain at time t if 
the Markov chain starts with state x. 
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B. Conditions for fast mixing of Parallel Glauber Dynamics 

We now state the main theorem of this section and a 
corollary on the mixing time of parallel Glauber dynamics 
with fixed fugacities A. The proof of the main theorem will 
be presented in the next section. 

We associate a weight f(v) > to each vertex (or link) 
v G V, and call /(•) the weight function. 

Theorem 2: For any positive weight function f(v) ofvGV, 
let m = min„ e y f( v )> M — max^gy f(v), and £ = — . If 



mm 



1vf(v) 



X, 



1 + A, 



-fH > o, 



(4) 



where q v , as defined before, is the probability that vertex v 
is included in the decision schedule, then under the parallel 
Glauber dynamics, 



<min{l,(l--^) t <},Vxea (5) 

Therefore the parallel Glauber dynamics is fast mixing and its 
mixing time is bounded by: 

■M 



T m 



< T ■ — 

- -L mix — 



log«e) 



(6) 



One can specify the weight function /(•) to obtain various 
conditions on the fugacities for fast mixing. We will show one 
of them below which will be used later to analyze the delay 
performance of PGD-CSMA. Other conditions are given in 
Appendix [A] 

Corollary 1: Let m = min^gy M = max„ 6 y ^ and 
£ = — , where d v is the degree of v in the interference graph. 



If A„ < 



where 



for all v € V, then 



T 
1 ni 



= mm 



< T ■ — 

_ mix — 



— log«e) 



d„ - 



weNv 



(7) 



(8) 



Proof: Choose f(v) = ^ 
Glauber dynamics is fast mixing if 




By Theorem [2] the parallel 



> 0. 



To achieve that, we need W £ V, d v — 



w^M v l + A, 



-d w > 



0. It is sufficient to have 
equivalent to X w < 



l+A, 



-d w < 1, Vw G V, which is 



Remark 1: Let A be the maximum vertex degree of G. If 
each link (vertex) sends the INTENT message independently 
with probability 1/2, then (1/2) A+1 < q v < 1, so m > 1 and 
M < A • 2 A+1 . Then we have 



T 



< 



A • 2 A+1 



log(A • 2 A+1 en) 



i.e., the mixing time grows as O(logn) for bounded-degree 
graphs when X v < t^t < ■ , Vu On the other hand, it was 



shown in [8] that traditional single-site Glauber dynamics has 
a mixing time at least f2(nlogn) for bounded-degree graphs. 
Therefore, parallel Glauber dynamics reduces the mixing time 
by an order of magnitude. 

Remark 2: For interference graphs with special structure, 
more relaxed conditions can be obtained to ensure fast mixing. 
This will be discussed in Section [W] 

IV. Proof of Fast Mixing 

This section presents the proof of Theorem [2] In particular, 
we use the coupling method to establish the logarithmic 
mixing time. Readers who are only interested in the main 
results could skip the section without loss of continuity. 

A. Preliminaries 

A useful technique to bound the mixing time of a Markov 
chain is via coupling. 

Definition 3: A coupling of the Markov chain is a stochastic 
process (X(t), Y(t)) onHxJ] such that {X(t)} and {Y(t}} 
marginally are copies the original Markov chain, and if 
X(t) = Y(t), then X(t + 1) = Y(t + 1). 

Let $ be a distance function (metric) defined on O x O, 
which satisfies that for any x,y,z € ft'. (1) y) > 0, with 
equality if and only if x = y; (2) $>(x,y) — $>(y,x); (3) 
$(x, z) < $(x, y) + $(y, z). Let 

= min$(ir, y), D max — max y), D 



D r , 



x¥=y 



x.y 



D r , 



The following theorem (e.g., [4Q can be used to bound the 
mixing time of the Markov chain. 

Theorem 3: Suppose (X(t),Y(t)) is coupling of the 
Markov chain where X(f) has distribution fi t and Y(t) has 
distribution fjf t = it. If there exists some constant /3 < 1 such 
that, for all x,y E Q, 

E[$(X(t + l),Y(t + l))\X(t) = x., Y(t) =y]< /3$(i, y). 

(9) 

Then 

||Mt-7r|Ur < min{l, P*D} (10) 
and the mixing time of the Markov chain is bounded by: 

(ID 



< 



rlog(De)- 


< 


rlog(£>e)i 


log/3- 1 




1-/3 



In general, determining j3 in the contraction condition (O 
is hard since we need to check the condition for all pairs 
of configurations (x,y). In Q the so-called path coupling 
method was introduced by Bubley and Dyer to simplify the 
calculation. Using path coupling, we only need to check the 
contraction condition for certain pairs of configurations. The 
path coupling method is described in the following theorem. 

Theorem 4: Let S C fl x f2 and suppose that for all x,y € 
fi x tt, there exists a path x = Zq, z\, . . . , z r = y between x 
and y such that (zi,zi + \) 6 S for < I < r and <f>(x,y) = 
®( z h z i+i)- Suppose (X(t),Y(t)) is a coupling of the 
Markov chain as in Theorem \3\ If there exists /3 < 1 such 
that for any {x,y) £ S, the contraction condition (O holds, 
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then we have ( 17 Oj and the mixing time of the Markov chain 
is bounded as in ([77}. 

Note that the key simplification in the path coupling theorem 
(Theorem 2]i, compared to the coupling theorem (Theorem [3), 
is that the contraction condition (0 needs to hold only for 
(x, y) € S, instead of (x, y) E x f2. 

B. Proof of Fast Mixing 

Now we analyze the mixing time of Glauber dynamics with 
parallel updates and heterogenous fugacities using the path 
coupling technique. We use the following distance function 
between feasible schedules. For any cr, r\ E ft, let 



Since, cr and r\ differ only at v, this also means that a w = 
for all w E J\f v . We have 



(12) 



v£L(7 A77 



where f(v) > is the weight function of v E V and a A 77 = 
(cr \ r)) U (77 \ <j) is the symmetric difference between a and 
77. Note that the distance function is a weighted Hamming 
distance function and satisfies all the properties of a metric. 

Consider the following coupling (cr(f), 77(f)): in every time 
slot both chains select the same decision schedule and use the 
same coin toss for every vertex in the decision schedule if that 
vertex can be added to both schedules. 

Let E[A$>(<j,r])] be the (conditional) expected change of 
the the distance between the states of the two Markov chains 
{cr(f)} and {77(f)} after one slot: 



£[A$(ct,77)] = 



£J[$(a(f+l),7?(f + l)|a(f) = a, 77(f) =77] 

-$(CT,77). 



For any mel, let 

7?[A m $(cr, 77)] = E[A$(a, 77) |m is the decision schedule]. 

We say that a, r\ E ft are adjacent and we write cr ~ 77 if 
there exists ueF such that a and 77 differ only at v. Let 

5 = {(<7, 77) : <7, 77 E and cr ~ 77} 

be the set of all pairs of adjacent schedules. Note that under 
the distance function defined in (fT2l . for all cr, 77 E O, we can 
find a path cr = To, ri, . . . , ruAjyl = ^ between cr and 77 such 
that (r;, 77+1) E 5 for < / < 0-A77I and 

|<7-Atj|-1 

H^v)= E Hn,n+i). 
1=0 

Lemma 1: Consider a pair of adjacent schedules cr and r\ 
that differ only at v (so $(ct, 77) = f(v)), 

£[A$(cr,77)] < -q v f(v) + V q w -^—f( w ). (13) 

Proof: Without loss of generality, suppose a v = and 
r) v = 1. Note that 77^ = 1 implies that r] w — for all w £ Af v . 



E[A$(<r,rj)] 
E m ^[A m $(cr, 77)] 



q m E[A m $(a, V )} 

mEl 

= E 9- E £[A a <f(cr,77)] - ^ g y S[A*$ (a, 77)]. 

Note that only updates on vertices v and w G can affect 
the value of E[A$>(a, 77)] because updates on other vertices 
will have the same outcomes due to coupling. 

If v is selected for update and since we use the same coin 
toss for both Markov chains, then cr(f + 1) = r)(t + 1) and 

$(cr(f + 1), T)(t + 1)) = 0. Thus £[A l '$(cr, ??)] = -f(v). 

If 777 E Af v is selected for update, under schedule 77, w can 
only take value because w has a neighbor (i.e., v) belongs 
to 77. While under schedule cr, there are two cases: 

(1) if w has a neighbor in cr, then w can only take value 0; 

(2) if w has no neighbors in cr, w can take value 1 with 
probability j^f— and value otherwise. 

Hence for w E AT V , E{A w $(a,r])} < jfaf(w). summing 
up all contributions we have dl3) . ■ 

Now we are ready to prove Theorem [2] 

Proof: (Theorem |2]l For any pair of adjacent schedules 
(cr, 77) E S that differ only at some vertex v E V, from (Tl3T > 
and (01 we have: 

i?[A$(cr,77)] < -9<-—<f>(a,r)), 
where we use the fact that $((7,77) = f(v) < M. Therefore, 

£[$(cr(f+l),77(f+l))|cr(f) = cr, 77(f) = 77] < (l- A) rj) . 

By applying Theorem 0] where /3 = 1 — and D = 7i£, we 
have (0 and the bound in ©. ■ 

V. Low Delay CSMA Scheduling for Wireless 
Networks 

In this section we analyze the delay performance of PGD- 
CSMA. In time slot f, let a,(f) be the number of packets 
arriving at link i, Vi. Assume arrivals occur at the beginning of 
a time slot and are i.i.d. with mean arrival rate E[a,i(t)] = Vi < 
1. For simplicity, we assume Bernoulli arrivals, i.e., a^(f) E 
{0, 1} with Pr(a,(f) = 1) = V{. Note that our results hold for 
more general arrival processes as long as the second moment 
E[af(t)] < oo, Vi. 

Let Qi (f ) be the queue length of link i at the end of time 
slot f. Then it has the following dynamics: 



Qi(t + 1) = [Qi(t) + Oi(t + 1) - <Ti(t + 1)] + 



(14) 



where [Q]+ = Q if Q > and [Q]+ = otherwise. 

The capacity region of the network is the set of all arrival 
rates v for which there exists a scheduling algorithm that can 
stabilize the queues, i.e., the queues are bounded in some 
appropriate stochastic sense depending on the arrival model 
used. In this paper, stability refers to the positive recurrence 
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of the Markov chain. It is known (e.g., [21]) that the capacity 
region is given by 

A° = {v > | 3/i £ Co(fl), v < ju}, (15) 

where Co(Vt) is the convex hull of the set of feasible schedules, 
i.e., (j, £ Co(O) if /x = Xltren^ "' where X)<r*f — 1 an d 
t<j > can be viewed as the fraction of time that schedule a is 
used. When dealing with vectors, inequalities are interpreted 
component-wise. We also define 

A := {v > | v £ Co(£l)}. 

We use the parallel Glauber dynamics to generate the 
transmission schedule a(t) in every time slot. We will show 
that a small mixing time of PGD implies a small average queue 
length in a wireless network under PGD-CSMA. By Little's 
law this also implies a small average delay in the network. 

A. Throughput and Fugacities 

The following theorem is slightly extended from iflOl . 

Theorem 5: Given any v £ A°, there exist suitable fu- 
gacities A such that for every link i, its mean service rate 
Si := 2~2a-iea 7r(c) is equal to the mean arrival rate Vi under 
PGD-CSMA with fugacities A, where ir(a) is given in 0. In 
other words, the service rates of PGD-CSMA can exactly meet 
the arrival rates at all links. Denote r* :— log(Ai). The vector 
r* = (r*) is the solution of the convex optimization problem 
max r F(r; u) where 

F ( r ; t/ )=Yl Viri ~ lo s(Xl ex P£ Vtn))- (16) 

i ctGO i 

Remark 3: A way to understand this result is that 
dF(r*;u)/dri = Vi - Si(r*) = 0,Vi, where s»(r*) is the 
mean service rate of link i with the fugacity Xj — exp(r*), Vj. 

Now we characterize the suitable fugacities A when v is in 
a fraction of the capacity region. 

Definition 4: The interference degree Xi of a link i is the 
maximum number of links in its conflict set Mi that can be 
active simultaneously. The interference degree of G is defined 
as X = max ie y Xi- 

Lemma 2: For an arrival rate vector u £ A°, let \{u) be 
the vector of fugacities such that the mean service rate s, is 
equal to the mean arrival rate Vi for every link i under PGD- 
CSMA. If v £ pA° (which means that -v £ A°) for some 

p < —, then 

r — x 

Ai(^) < V*. (17) 

Proof: For easy notation we write A(i/) as A in this proof. 
Under PGD-CSMA with fugacities A, let p ifi be the steady- 
state probability that no link in A/i is transmitting, i.e., 

Pi.o = 7r ( cr )- 

Using ([TJ, it is not difficult to show that (a detailed derivation 
is given in Appendix IB1 

v% = Si = ^ Pi l0 . (18) 



Since 1 — p^o is the probability that at least one link in A/j 
is transmitting, a union bound yields 

l-j*,o< E s i= E v r (19) 

On the other hand, note that u' := -u G A°, hence there 
exists a scheduling algorithm which can serve v' . Under that 
scheduling algorithm, 1 — v[ is the fraction of time link i is 
idle. The links in A/i can be served only when link i is idle, 
and in this case at most \ of them can be served. Therefore, 

£^< X(l - v'i)- (20) 

Combining ( TT9b and (l20b we have 

l-Pi,o< J2 ^<PA(1--)<1-- (21) 

since p < — . Hence vi < ppi.o which implies . 7I < p and 

X 1 1- 1 "i 

^ < 1^. ■ 

Note that for any graph its interference degree x is smaller 

than or equal to its maximum vertex degree A, hence we have: 
Corollary 2: If the arrival rate vector v £ then 

Ai(i>) < ^-y, Vi (22) 

B. Delay Performance of Fixed-Parameter PGD-CSMA 

Consider a wireless network with n links and suppose that 
the maximum degree of the interference graph is A which is 
independent of n. The queue length of link i, Qi(t), follows 
the dynamics in (fT4l) . 

Theorem 6: If the arrival rate vector u 6 ph° for some 
constant p < t, f/;e« f/iere exist fugacities A smc/i f/iaf /n f/;e 
steady state, the expected queue length E[Qi(t)] = O(logn) 
wnder PGD-CSMA with A. 

Proof: First, we have 1 e A. This is a direct conse- 
quence of Theorem 1 in [7 |. Choose a constant e' G (0, i— p). 
Then ^pjl £ e'A. Next we state a lemma used in |20|. 

Lemma 3: If u a £ p a A ant/ i/^ £ P&A , then v a + 1/5 G 
(Pa + Pb)A. 

Since 1/ £ pA°, there exists > 1/ such that /1 £ pA. By 
Lemma[3] we have /1 + ^jl G {p + e')A. So u + £ 
(p + e')A°G ^A. 

Let A be the fugacities such that Sj = vi + > t-j,Vi, 
where Sj is the mean service rate for link i under PGD-CSMA 
with (fixed) fugacities A0 By Lemma [2] we know that Aj < 
(p + e')/[l - (/o + e')] < 1/(A - 1), Vi. Therefore the PGD 
has a mixing time of 0(log(n)). 

From (TBI and since aj(t + 1) > 0, we have 

Qi(t + 1) < [Qi(t) - cr,(t + 1)]+ + Oi(t + 1). (23) 

3 In order to find A that yields the desired service rates Sj's, we can use the 
(fully-distributed) adaptive CSMA algorithms with diminishing step sizes as 
proposed in 1141 . II 11 . |9 |. In these algorithms, the links dynamically adjust 
the fugacities and make them converge to the proper A. In this subsection, 
we quantify the expected queue lengths after such A is found and fixed. 



7 



More generally, consider T consecutive time slots beginning 
from slot t, we have 

T T 

Qi(t + T) < [Qi{t)-J2<ri(t + k)]+ + J2 ai ( t + V 



k=l 



fc=l 



[Qi^-TSih+Tai 



(24) 



where a, = y X)fc=i a,i(t + k) and Sj = y X)fe=i "i(f + &) are 
the average arrival and service rates during the T slots. The 
RHS of d24"l i can be viewed as the virtual queue length after 
T slots if we assume arrivals to link i during the T slots occur 
at the end of time slot t + T, which is clearly larger than or 
equal to the actual queue length Qi(t + T). 

From (l24l and note that < a,, s, < 1, we have 

Qi(t + T) < ([Q^-Ts^+lVf 

< [Qi{t)-Tsif + 2Ta l Q l {t)+T 2 a 2 

= Q?(t)+2TQ i (t)(a i -s i ) + T 2 (a 2 + s 2 ). 

< Q 2 {t) + 2TQi{t) (&i ~ + 2T 2 . (25) 

Note that (Q(i),cr(t)) evolves as a Markov chain. Define 
the Lyapunov function L(t) = From d25l ) and 

since E[Qi(t)si\Q(t),<r(t)] = Q l (t)E[s l \a(t)}, we have 

E[L(t + T)-L(t)\Q(t),a{t)] 
< Tj2Qi(t)("i - E[sMt)}) + nT 2 . (26) 

i 

Now let us compute E[si\a(t)} for every link i: 
1 T 

£[*K*)] = ^%(t+^)Mt)] 

fe=i 
1 T 

= 5; ^Pr(a i (t + fe) = l|(7(t)) 
fc=i 

1 T 

fc=l <r£fi:<Ti = l 

where ^ a [t),k is the distribution of the Markov chain of the 
schedules after k slots if the Markov chain starts with schedule 
o-(f). Remember s; = I]^^.^ 7r(er), so 

\E[Si\<r(t)] - ml 
1 T 



fc=l aefl:<Ji = l 
T 



- X M<7(t),fc(o-) - XI 7r ( <j ) 

fe=l o-ef2:a-i=l 
1 T 

- y / J llM<r(t),fc ~ 7r||t)o 



fe=l 



Since A; < -531' from Theorem|2]and Corollary Q] we know 
that the parallel Glauber dynamics under A is fast mixing, and 
we can find D = n£ and /3 = 1 — < 1 such that 

I \l*a(t),k ~ n\\var < min{l, Dj3 k }. 



Let T = Lts^J so £>/3 To+1 < 1, we have 



\E[siW(t)} - Si 



< 



1 To 
fe=i 



< 







1 

1-/3 



T 



< 



fc=T + l 

J- n 



T 



(27) 



where T mix = 



log(-De) 
1-/3 



Y3g. If we choose T = 
— Si I < e and 



[2km] f or sorri e e > 0, then |E[sj|cr(f)] - 

£[s*H*)] >«i-e,Vi. (28) 
Plugging into ( 1261 ), we have 

E[L(t + T)-L(t)\Q(t),a(t)} 
< -T £ Qi (*) (*< - «4 - e) + 

where the right-hand-side is negative if e < mirij(sj — Kj) 
and ||Q(f)|| is sufficiently large. This establishes the negative 
drift of L(t). By the Foster-Lyapunov criterion, (Q(f),cr(f)) 
is positive recurrent. 

In the steady state, taking expectations of both sides of ( T25T ) 
and using d28l ), we have 

E[Q 2 (t + T) - Qj{t)] = < -2TE[Qi(t)]( Si - u t - e) + 2T 2 



which implies 

E[Qi(t)} < — 



T 



< 



1 



^ - ej 



^(T m i x ~l~ 1) 

(Si 



V; 



(29) 



and T = |" m " ] in our analysis of 
Vi = e'/(A + 1) is independent of n. 



by choosing e = — 
link i. Recall that s 
Also, as we have proved in Section IIV-BI for bounded-degree 
interference graphs, T mix = O(logn). These facts combined 
imply that E[Q t (t)} = O(logn) under PGD-CSMA. ■ 

C. Delay Performance of Dynamic-Parameter PGD-CSMA 

In this subsection we consider PGD-CSMA with dynamic 
parameters (fugacities). That is, unlike the last subsection 
where we assumed that suitable fugacities have been found 
and fixed, here the fugacities are dynamically adjusted based 
on the local queue length information. 

Given an interference graph G with n links. Suppose G has 
a maximum degree A and an interference degree \ which are 
all independent of n. 

Let B be such that exp(B) < — Assume that u G 

i+cx ( p^B e ) AO where B £ := B - e e (0, B), and that each 
element > v m i n for some constant v m i n > 0. 

Proposition 1: Under the above assumptions of v, we have 

v m in < Ai(f) < exp(B e ),Vi. 

Proof: By Lemma|2] we have \ < exp(B e ). Also, by 
(fT8l . one has i>i = Sj < Aj(i/)/[l + \i(y)\. Since Vi > v m in, 
we have Xi(u) > v min . ■ 
Remark 4: Let r* = argmax r F(r; v) where F(r;u) is 
defined in H16h Since r* — log(A,(i/)), we have r* G 
[r m in,B e ],Vi where r min := log(iv min ). 



s 



We further select B such that if \ k < exp(-B), Vfc, then the 
parallel Glauber dynamics is fast mixing and the mixing time 
is upper-bounded by T m i X = O(logn) (by Corollary Q] this 
can be achieved when exp(B) < xzt). Now we propose an 
algorithm to dynamically adjust the fugacities of PGD-CSMA. 

Algorithm 1: The fugacities of the links, denoted by the 
vector A, are updated every T time slots, where 



T 



T 



An ■ (B 



a) 



0{n log n) (30) 



with a := S/n and 

exp(r) 



6 :-- 



exp(B e ) 



l r =B c l + exp(r) l + exp{B e ) 

Specifically, at the end of slot jT,j = 0,1 
updates its fugacity to be 



dr > 0. (31) 
, link k 



\ k [j] = exp(min{r fe [j], J B}),V/c 



with 



a 



r k[j] ■= 7pQk[j] +r„ 



a 



(32) 



(33) 



where Q k [j] is the queue length of link k at the end of slot jT, 
i.e., Qk[j] = Qk{jT). Note that Q k (t) follows the dynamics 
( TT4"1 >. Also note that the fugacity vector A[j] is used for T time 
slots (from slot jT + 1 to (j + 1)T, which we call "frame j"). 

Remark 5: By ( 132 D and ( 155P , link k increases its fugacity 
when its queue length increases (unless the fugacity has 
reached exp(B)). So link k transmits more aggressively when 
its queue builds up. Also, since Xk[j] < exp(_B),Vfc at all 
time, PGD is fast mixing in each frame of T slots. 

Remark 6: Algorithm 1 is designed such that r[j] is at- 
tracted towards r* £ [r m i n ,B f ] n . So, by ( |33| ), the queue 
lengths are attracted towards an affine function of r* and are 
therefore stabilized; and by ( 1521 ), \[j] is attracted to X(f). 



Algorithm 1 and the consequent proof techniques are quite 
different from existing works (e.g., |9]). Specifically, unlike 
|9l , the fugacities in Algorithm 1 are direct functions of the 
queue lengths. Also, we derive a polynomial delay bound 
instead of an exponential bound in [9 1. To do that, we apply the 
mixing time results in Section [III] and use a novel Lyapunov 
function in the stability proof. 

Theorem 7: The queue length vector Q(t) is stable (i.e., 
positive recurrent) under Algorithm 1. 

We first need a lemma. 

Lemma 4: For any vector r with some element > B, we 
have F(r; v) < F(r*; v) — 5, where S > is defined in 07] ). 
Note that 5 is independent of n. 

Proof: We first show that F^r; u) < F(r*;u) — S if r fc = 
B for some k. Denote 

F k (fk]v) = maxF(r_ fe ,r fe =f k ;v). (34) 

Then clearly F(r; v) < Fk(B; v). So it is sufficient to prove 
F k {.B-u) <F(v*;u)-S. (35) 



Denote the solution of RHS of ( [341 by f_/ £ (r/ c ), and let 
r(ffe) := (r k = r fc ,r_ fc = f_ fe (f fc )). Then, the envelope 
theorem implies that 

dFk{r k ]v)/dr k = dF(r(f k );v)/dr k = v k -s k (v(f k )). (36) 

By the definition of r^ k (f k ), we know that for any k' ^ k, 

dF(v(r k );v)/dr k , = vy - s/ s /(f(f/ ! )) = 0, so s fc /(f(f fc )) = 
v k >. Therefore, s(f(f fe )) - v = (s fc (r(f fc )) - v k ) ■ e k . 

Note that v G t^^ A. Given a r k £ (B € ,B], by 

Lemma|2]we know that s(r(f k )) ^ pA for any p < ■ 

So s fc (r(f fc )) - v k > 0. 

Since (s k (r(f k )) - v k ) ■ e k E (s k (r(f k )) - v k )K, using 
Lemma [3] we have s(r(f fc )) = u + (s k (r(f k )) - v k ) ■ e k G 
[ l+cxp^B 1 + S fe( f '( f fe)) _ v k]A. Since s(f(f fe )) ^ pA for any 
P < S). ^ must be that s k (r(r k )) u k > 



L+cxp(r fc ) 



i+ZSb.) - Usin 8 69. one has dF k (f k ;is)/dr k < rg^y 



l+cxp_(B e ) 

cxp(r fc ) 
l+exp(r fc ) ' 

Therefore, we have 



F k {B;u) 
= F k (B £ ;v)- 

< F(r*;v)- 



exp(S £ ) 



exp(r fc ) 



- 1 + cxp(B e ) 1 + exp(r fe ) 
exp(ffe) exp(B e ) 



]dr k 



f k =B c 1 



exp(r fc ) l + exp(_B e ) 



dr k . 



So, (|35) holds with S defined in d3B . 

Finally, we need to show that F(t;v) < F(r*;v) — 8 if 
f k > B for some k. Given such a f, one can find a ?' on 
the line segment between f and r* to satisfy f' k — B. We 
already know that F(r'; v) < F(r*; v) — 5. By the concavity 
of F(-; u), we have F(r; v) < F(r*;u) - S. ■ 

Now we are ready to prove Theorem [7] 

Proof: (Theorem [7]i We first show that r[j] is stable. 
Equation d33l implies that r k [j) > r m i n — a,Vk,j. Denote 



r kiJ\ 



mm{r k [j],B}. 



Then X k [j] = exp(r' k [j}), and r'[j] e B := [r r< 
We define a Lyapunov function 

L(v) :=^i fc (r fe ) 



(37) 



where 



L k (r k ): = (B-r* k )(r k -r* k )I(r k >B) + 



Then, 
8L(t) 



dr k 



l -[(r k -rlf + (B-rlf]I{r k <B). 



(B - r* k )I(r k >B) + (r k - r* k )I(r k < B). (38) 



For simplicity, we write F(r; u) as F(r). For a r G [r m i n - 
a, oo) n but r ^ B, let r' be its projection on B (i.e., r' k - 
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Fig. 2. An example when n = 2. 

minjrfc, B}, Vfc. See Fig. [2] for an illustration). Then by the 
concavity of F(r) we have 



k 



8F{r') dL(r) 
dr' dr k 



k "'- 



< F(r') - F(r*) < -8 (39) 



where the last step has used Lemma 0] 

We then claim that given any r[j] G [r m i n — a, oo) 

L k {r k \j + 1}) <L k (r k \j + l]),Vk 



(40) 



where f k [j + l] := r fc [j] + a[a fc [j]-s fc [j]], with a fe [j] denoting 
the average arrival rate from slot jT+ 1 to (j + 1)T, and s k [j] 
denoting the average service rate in the same interval with the 
fugacity vector X[j]. 

To show d40b we consider two cases: 

(i) If from slot jT + 1 to (j + 1)T, queue k is not empty 
whenever it is scheduled to transmit in the CSMA protocol, 
then Q k [j + 1] = Q k \j] + T[a k \j] - s k []}}- By (El, we have 
rk\j+l] = r k \j]+a[a k \j]-s k \j}] = f k [j + l}. Then L k (f k [j + 
l]) = L k (r k \j + l]). 

(ii) Otherwise, one has 

Qk[j + 1] > Qk[j] + T[a k [j] - s k [j']]. (41) 

Inequality (|24|) is equivalent to Q k [j + 1] < [Q k [j] - T ■ 
s k [j}}+ +T-a k [j]. Suppose that Q k [j + 1] > T, then [Q k [j] - 
T ■ s k >Q k [j + l]~T- a k [j]>T-T- a k [j] > 0, which 
implies that Q k [j] > T ■ s k [j]. But if this holds, queue k never 
gets empty from slot jT + 1 to (j + l)T, contradicting the 
assumption. Therefore, we have Q k [j + 1] < T. Using this, 
d4Tb and d33l . one has 

?k[j + 1] < r k [j + 1] < 

which implies d40l > since r k > r m i n . This completes the proof 
of (00). 

Inequality ( f4Qb immediately implies that 



L(r[j + 1]) <L(f[j + l]). 



(42) 



Let Tj be the c-field generated by {Q[j'], r[/], cr[j']}, j' = 
0, 1, 2, ... , j where cr[j'] = fj(j'T) is the state of the CSMA 
Markov chain (i.e., the schedule used in time slot j'T). In 
the following, we write the conditional expectation E(-\Tj) 
simply as Ej(-). Using Taylor expansion, 



< 
< 

< 



E [L{v[ ] + l])-L{v[j])] 
^[L(f[j + 1])-L(r[j])] 



k 



{Wk- Ej (s k[M d -^\ 



dr k [j] J 
dL{v\j]) 



«E{K-«(^1)]^f} 



dL(r[j}) 



;i / 



-na 



dr k [j\ 

By d30j and d27]), we have 

l^i [**[?']] ~ s fe (r'L7'])| < S/(4n ■ d max ),Vj,k 



k 

(43) 



(44) 



where d max := B - r min + a. 

If r[j] ^ B, we use (03), (031) and ([39]l to derive the 
following: 



a 



E 3 [L{v[ 3 + l])-L{v\j])} 
^ dF(rWdL{r\jJ} 



4n • eL 



--na 
2 



<E 



dF(v'[j})dL(r[j}) 
dr'Aj] dr k [j] 



a— 

4 



— na 
2 



< af-<5 + ~ + -na) = -a6/4 
4 2 

which establishes the negative drift of L(r[j]). 

By the Foster- Lyapunov criterion, r[j] is stable, and by d33l . 
Q[j] is also stable. Since in each time slot, the change of each 
queue length is at most 1, we conclude that Q(<) is stable. ■ 

Theorem 8: Suppose that the arrival rate vector u € pA° 
where p < and each element v k > v m i n . Let B e = 

log^) so that v € ^ffj^ A . Let B = log^) > B e 

where the constant p G (p, so that if X k < exp(B) = 

p , < -^zj i the parallel Glauber dynamics has a mixing 
time of 0(log(n)). Then, the queue lengths under Algorithm 
1 satisfy 

~ (45) 



k 



h =0(n 3 logn) 



where Q k :— limsupj^^^ Y2t=i E(Q k (t))/M is the average 
expected queue length at link k. 

Proof: Denote L max rg g ^(r). Then if L(r) > L, we 
have r ^ B. Define 

G(r) := [L(t)-L]+. 
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Note that |r fe [j + l] -r fc [j]| < a,\fk. Also, <[38j implies that Since B — r%> B - B e = e, Vfc, we have 



|^| <d max . Then 

\L{v\j + l]) -L(r[j])| < nad 

Case 1: If L(r[j}) - L > c, then L(r[j + 1]) - L > 0, and 

G 2 (r[ J + l])-G 2 (r[j]) 
= [L(r[.7 + l])-Z] 2 -[L(r[j])-Z] 2 
= 2[L(r\j])-L}[L(r\j + l])-L(r\j])] + 

[L(r[ J + l])-L(r[j])] 2 
< 2G(r[j])[L(rb- + l])-L(r[j])]+ C 2 . 

Therefore 

^■[G 2 (rb' + 1])-G 2 (rb-])] < 2G(r\j])A\j] + c 2 

< -G(r[j])a(5/2 + c 2 . 

Case 2: If G(r[j]) < c, then < G(r[j + 1]) < G(r[j]) + c. 
Therefore 

^■^(rb' + lD-G 2 ^])] 

< 2cG(r[j]) + c 2 < 3c 2 

< -G(r[j])c«5/2 + caS/2 + 3c 2 . 

Combining case 1 and 2, we have 

^■[G 2 (r[j + 1])-G 2 (r[j])] < -G(v\j])a6/2 + caS/2 + 3c 2 . 

Taking expectations on both sides yields 

£[G 2 (r[j + l])-G 2 (r[j])] < -E[G(v[j})}aS/2+caS/2+3c 2 . 

Summing the above inequality from j — to j = J — 1, 
and dividing both sides by J, we have 

£[G 2 (r[J])-G 2 (r[0])]/J 

.7-1 

< -(aS/2) E[G(r\j])]/J + cad/2 + 3c 2 . 

3=0 



Therefore, 

.7-1 



]imsupJ2E(G(r\j]))/J< 

.7 — >• co 



cac5/2 + 3c 2 



k 

J-l 

Urn sup E & k W)/ J 

J-l 



J— >oo 



< - lim sup ^ ^ (£? - r* k )Q k \j)/J = 0(n 3 log n) . 



e J 



3=0 k 



3=0 

Note that 



aS/2 = C+6n P = ° {n) - 



Since in a slot each queue is increased at most by 1, Qk < 
limsupj^ E^o E(Q h \j])/J + T where T = 0(n log n). 
Therefore (05) holds. ■ 

VI. Complete Interference Graphs 

We have shown that in general interference graphs, PGD- 
CSMA can achieve polynomial queue lengths when the arrival 
rate vector lies in of the capacity region. In this section, 
we will show that the polynomial-delay region can be further 
improved for certain interference graphs. We only consider 
fixed-parameter PGD-CSMA for simplicity. 

Consider a wireless local area network where every mobile 
station maintains a 1-hop link to the Access Point. This is an 
important scenario in practice, where all links conflict with 
each other and the interference graph is a complete graph. 

In such a complete interference graph with n links (note 
that this graph does not have a bounded degree as n — > oo), 
suppose the fugacity of link i is A^. Under PGD-CSMA with 
fugacities A, the steady-state probability that link i is active 
is simply 

ne 4 ) = y = s h 

where is the schedule with only link i active, Z = 1 + 
T^j=i an< ^ Si ^ s trie mean service rate of link i. Also, denote 

Amaa: • — IllclXj Aj. 

To determine the decision schedule in each time slot, similar 
to the scheme in [16], each link independently sends an 
INTENT message with probability 1/n. If the transmitter of 
link i sends the packet and the packet is successfully received 
(indicated by an acknowledgement from the receiver), then 
link i is included in the decision schedule. So, the decision 
schedule which only includes link i is chosen with probability 



W(r) := 53(£ - r*)(r fc - r*) < L(v) < G(r) + L. 



1(1 _ I)n-i := W here c„ := (1 - i)"" 1 > c„ 
0.2, Vn; and the decision schedule which includes no links is 
chosen with probability 1 — c n . 

Lemma 5: Let \i x \t be the distribution of the transmission 
schedule in slot t with any initial schedule x' , we have 



So 



life' 



(46) 



j-i 



lim sup 53 E (W{r\j]))/J < 0(n) + L — 0(n) 



J— >oo 



3=0 



sinceL = ELi(5-^) 2 = 0(n). 
In view of d33l ), we then have 



where 7=1 n c "V" — r. 

Proof: Consider two copies of the Markov chain X(t) and 
Y(t). We will construct a coupling {X(t),Y(t)}, with Y(0) 
chosen from the stationary distribution n. With this coupling, 
we show that for any x, y, 



J-i 



lim sup 53 J2( B -r*kMQk\j])/J - Q{-n) = 0(n 3 log 



P(X(t + 1) = Y(t + l)\X(t) = x, Y(t) = y) 



J— >oo 



3=0 fe 



> c:= 



/[n ■ (1 + Xmax)}- 



(47) 



II 



Since there is at most one link active in a given slot, for 
convenience, we write the state as i if link i € {1,2, ... ,n} 
is active in the slot, and if no link is active. 

If x — y, then the two Markov chains have already coupled 
in slot t, so that P(X(t + 1) = Y(t + l)\X(t) = x,Y(t) = 
y) = 1. If x ^ y, there are three cases. 

(i) x = i\ and y = 12 where i\ ^ 12 and i\,i2 7^ 0. 
According to the Glauber dynamics defined above, with 

probability c n /n, one link is selected in the decision schedule, 
and with probability 1 — c„ no link is selected. We define the 
following coupling. 

If link ii is selected in the process X(-), then select link 12 
in the process Y(-). WLOG, assume that Xi 1 > \ 2 . Then, turn 
off link i\ in X(-) w. p. 1/(1 + A^). If link i\ is turned off. 
then also turn off link 12 in Y(-). If link i\ is not turned off. 
turn off link i 2 w. p. \yl{\+\ a )-l/{l+\ 1 )]/[\J{l+\ 1 )] 
Then it is easy to see that link i 2 is turned off w. p. 1/(1 + A.; 2 ) 

If link 12 is selected in X(-), then select link i\ in Y(-) 
Clearly, after the selection, they cannot be turned on. 

If a link other than i\ and 12 is selected in X(-), then select 
the same link in Y(-). Also, any link selected must remain off 
at time 1. 

If no link is selected in X(-), then also select no link in 
Y(-). 

Therefore, 

P(X(t + 1) = Y(t + l)\X{t) = x, Y(t) = y) 

> P(X(t + 1) = Y(t + 1) = 0\X(t) = x, Y(t) = y) 

> c min /[n- (1 + AjJ] 

(ii) x = i and y = where i ^ 0. 

In X(-) and Y(-), we choose the same link (in the decision 
schedule), and use the same coin toss to decide whether the 
chosen link should try to turn on or off. Therefore, w. p. c n /n, 
link i is chosen, and then X(t + 1) = Y(t + 1). So, 

P(X(t + 1) = Y(t + l)\X{t) = x, Y(t) = y) 

(iii) x = and y = i where i ^ 0. 
This is symmetric to case (ii). 

Therefore, in any case, (|47| | holds, and we have 

P(X(t+l) ? Y(t+l)\X(t) = x,Y(t) =y)< l-c,Vx,y,t. 
This implies that 

P(X(t) ^ Y(t)\X(0) = x') < (1 - c)*,W. 
By the well-known coupling lemma (e.g., Q), 

||/ix',t - 7r||„ Qr < P(X(t) / y(t)|X(0) = x 1 ) 

for any initial state x' , hence proving d46l >. ■ 
Theorem 9: Given any p < 1 (which is independent of n), 
PGD-CSMA can support u G pA with a mixing time of 0(n). 



Proof: For the complete interference graph, 

n 

A = H5> i <l,i/ i >0,Vj}. 
i=i 

Therefore, since v G pA, we have X)?=i — P- v ' := 
v + ^1 satisfies that y™, i/ H < < 1. As a result, 
I/' G G A°. 

Let A be the vector of fugacities such that Sj under PGD- 
CSMA is equal to v[ > vi at each link i. We have Sj = 



1/^, Vi and 



E"=i a j 
+ EJUA. 



= £4 



3=1 J j=i 



Therefore, A ma;r < X)j=i A? — jz^- Using Lemma [5] and 
(Hit , we know that the mixing time 



„ 1 _ re ■ (1 + A ma:z ) 2re 

^ 7712X - ^ 



1-7 



0(n). 



VII. Conclusion 

In this paper, we have shown that Glauber dynamics based 
CSMA can result in queue lengths that only grow polyno- 
mially in the network size in bounded degree graphs if the 
arrival rates lie within a certain fraction of the capacity region. 
This establishes a positive result in contrast to previous results 
which showed that polynomial queue lengths are not possible 
if the interference graph of the network can be arbitrary [20|. 
To establish our results, we use Markov chain coupling theory 
to estimate the mixing time of parallel Glauber dynamics. It 
is interesting that our upper bound on the queue lengths is 
larger (in an order sense) for the dynamic -parameter algorithm 
compared to the fixed-parameter algorithm. However, it is 
unclear whether this is an artifact of our bounding techniques 
or if it is an inherent penalty due to the time required for 
adaptation. It would be interesting to explore this issue in 
future work. We are also interested to study the throughput and 
delay performance of CSMA scheduling algorithms in other 
important classes of interference graphs, and design enhanced 
or new algorithms to further improve the performance. 

Appendix 

A. Other conditions for fast mixing of parallel Glauber dy- 
namics 



Corollary 3: Let m = mm ve y + - , M = max^gi/ 
and f = M . If 



= min < 1 + X v - 



> 0, 



(48) 



then we have 



T ■ < T ■ — 

mix _ mix — 



— log(n£e) 



(49) 



Proof: Choose f(v) = Vt> G V in Theorem |2] 
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Corollary 4: Let q n 
andi = ^.If 



mm veV q v , q v 



ma,x veV q v , [14] 



St 



A,, 



o — max , 



< 1, 



f/ien we /zave 



T 



_^ J- mix 



log (<e) 



(50) 



(51) 



Proof: Choose /(u) = — , Vt> E V in Theorem [2] ■ 

Remark 7: Note that the condition X v < d 1 _ 1 for all 
v E V in Corollary Q] might be very different from b = 
max v& vYlwEN l+i < * n Corollary^ e.g., in graphs 
of star topology. 

B. Proof of KM 

Let A := {a E VL\a t = l,aj = 0,Vj E A/"J. Then S; = 
EaeA^v)- Let B == W e fllo-i = 0,0-j = 0,Vj E M}. 
Clearly, .4 n S = 0. Define C = {a + e»|(7 E 23}, where e, is 
the n-dimensional vector whose i-th element is 1 and all other 
elements are 0. We claim that A = C. (Indeed, any a' E A can 
be written as a' = cr+e^ for some a E S. So a' E C. Also, any 
cr' E C is in A) Therefore, J2a>eA n i a ') = E^ee ^ + e 0- 
By ([T), we have 7r(o- + e j; ) = Aj7r(o"). So s, ; = ^l' 7 ') = 

A iE -eB 7r (°')' As a result ' Pi,0 = E CT :<r 3 =0,VjeM = 

E.'e^ ^(^') + E.ee t(") = i 1 + £K P rovin g OS- 
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